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ABSTRACT 

We  present  the  design  of  the  Snackbot,  a  robot  that  will  deliver 
snacks  in  our  university  buildings.  The  robot  is  intended  to 
provide  a  useful,  continuing  service  and  to  serve  as  a  research 
platform  for  long-term  Human-Robot  Interaction.  Our  design 
process,  which  occurred  over  24  months,  is  documented  as  a 
contribution  for  others  in  HRI  who  may  be  developing  social 
robots  that  offer  services.  We  describe  the  phases  of  the  design 
project,  and  the  design  decisions  and  tradeoffs  that  led  to  the 
current  version  of  the  robot. 
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I.  INTRODUCTION 

Experimental  systems,  including  receptionists,  assistants,  guides, 
tutors,  and  social  companions,  have  been  developed  as  platforms 
for  research  and  technology  development  [3]  [4]  [5]  [6]  [8] 
[10][11][13][14][18][22][26][30][32][34][36].  Much  of  this  work 
has  taken  place  in  the  research  laboratory,  but  a  few  systems  have 
made  the  successful  transition  to  real  world  settings  such  as 
museums  and  educational  institutions  [16][18][23][28][29].  Real 
world  settings  raise  the  bar  to  fluid,  natural  interaction  with 
robotic  systems. 

Robots  in  real  settings  also  need  to  interact  with  people 
appropriately.  Safe  interactions  are  necessary  to  contribute  to 
ethical  research  in  the  field,  to  improve  people’s  trust  in  and 
comfort  with  robotic  technology,  and  to  ensure  safety  and 
reliability  for  all  who  come  into  contact  with  this  technology. 
Socially  appropriate  interaction  behaviors  are  needed  so  people 
like  the  robot  and  are  interested  in  interacting  with  it  over  time. 
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Figure  1.  The  Snackbot  robot. 

Our  research  group  seeks  to  develop  robots  that  travel  around  and 
near  people,  and  that  support  them  in  real-world  environments. 
We  are  interested  in  developing  robots  that  act  as  social  assistants, 
with  the  ability  to  use  speech  and  gesture,  and  engage  people  in  a 
social  manner.  A  major  goal  is  to  create  mobile  robots  that 
interact  with  people  over  a  period  of  time,  performing  a  service. 

Many  questions  about  long-term  HRI  are  unanswered.  How  do 
people’s  perception  and  attitudes  towards  a  robot  evolve  over 
time?  What  interaction  design  strategies  will  reinforce  a  positive 
long-term  relationship  between  people  and  a  robot?  Will 
employees  use  a  robot  in  the  way  that  designers  intended  or  will 
they  appropriate  the  robot  in  new  ways,  as  has  happened  with 
other  technologies  [9]?  Could  robots  deliver  services  that  are 
beneficial  to  people  over  the  long  term?  How  should  robotic 
products  and  services  be  designed? 

To  address  these  challenges,  we  designed  and  developed  the 
Snackbot,  a  robust  robot  that  will  roam  semi-autonomously  in 
campus  buildings,  offering  snacks  to  office  residents  and 
passersby  (Figure  1).  We  designed  the  Snackbot  not  just  as  a 
service,  but  also  as  a  research  platform  to  investigate  questions 
related  to  long-term  interaction  with  social  robots. 
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Our  future  work  with  the  Snackbot  will  involve  field  trials  with 
the  robot  in  its  actual  context  of  use.  Such  research  poses  several 
technical,  interaction,  and  design  challenges.  First,  the  robot  must 
be  robust  and  powerful  enough  to  operate  autonomously  and 
interact  with  multiple  users  for  extended  periods  of  time.  The 
technology  should  also  be  flexible  enough  to  accommodate 
technical  improvements  and  new  applications.  To  test  different 
approaches  to  human-robot  interaction  over  time,  researchers 
should  be  able  to  manipulate  aspects  of  the  robot’s  physical 
appearance  and  behavior.  We  are  particularly  interested  in  how  a 
robot  delivers  a  service  after  the  initial  novelty  effect  has  worn 
off. 

In  this  paper,  we  present  our  design  process  for  the  Snackbot, 
shaped  by  our  initial  design  goals,  constraints  we  discovered 
along  the  way,  and  design  decisions  guided  by  interim  empirical 
studies.  We  document  this  process  as  a  contribution  for  others  in 
HRI  who  may  be  developing  social  robots  that  offer  services. 

2.  CONTEXT  OF  USE 

Robotic  advances  are  being  directed  towards  special  populations, 
including  elders,  those  with  physical  and  cognitive  disabilities, 
and  others.  We  want  to  design  robots  that  can  interact  with  almost 
everyone,  regardless  of  any  dispositions  to  using  technology.  To 
satisfy  this  goal,  we  are  interested  in  how  a  robot  can  deliver  a 
service  within  a  work  environment. 

We  chose  to  design  a  robot  that  would  provide  snack  deliveries  in 
the  two  connected  buildings  in  which  we  work.  By  “snack”  we 
mean  light  food  eaten  between  meals.  Snacks  include  “junk  food” 
such  as  food  offered  in  vending  machines,  and  “healthy”  snacks 
such  as  fruit  and  nuts.  Snacking  is  practiced  by  a  majority  of 
people  in  the  developed  world  [1][25][33].  In  workplaces,  people 
snack  in  their  offices  and  labs  as  well  as  in  halls,  cafeterias,  and 
food  vending  areas. 

A  robot  delivering  snacks  must  have  a  wide  range  of  mobility. 
The  buildings  are  large,  ranging  between  4  and  8  floors.  About 
1000  people  work  or  visit  these  buildings  each  day.  Because  the 
buildings  offer  only  prepackaged  snacks  in  convenient  locations, 
we  felt  a  snack  service  that  offered  higher  quality  snacks  would 
be  a  useful  application  for  a  long-term  product  and  service  in 
these  buildings.  Most  snacks  that  do  exist  are  highly  caloric,  and 
the  robot  could  include  healthier  snacks  in  its  offerings.  We  felt 
many  technical  and  design  research  questions  could  be  discovered 
in  understanding  how  a  robotic  snack  service  might  succeed 
within  the  social  and  environmental  context  of  our  buildings,  how 
it  would  differ  from  traditional  vendors  and  vending  machines, 
and  how  it  could  support  people’s  goals  such  as  taking  a  break 
from  work  and  delivering  snacks  as  gifts  to  people.  We  have 
described  some  of  the  research  supporting  these  decisions  in  a 
separate  paper  [19].  This  research,  combined  with  our  overall 
research  goals  in  HRI,  led  to  the  three  design  goals  that  anchored 
our  design  process. 

3.  DESIGN  GOALS 

We  had  three  design  goals  for  development  of  the  Snackbot  robot: 

The  first  was  to  develop  the  robot  holistically.  Rather  than 
advancing  technology  per  se  or  focusing  on  one  aspect  of  design 
or  interaction,  such  as  a  dialogue  system,  we  took  a  design 
approach  that  considered  the  robot  at  a  human-robot-context 
systems  level  [24].  Such  an  approach  allowed  us  to  think  about 
the  emergent  qualities  of  the  product  and  service,  which  might  not 


be  recognized  if  the  system  were  analyzed  in  component  parts 
rather  than  holistically. 

The  second  goal  was  to  simultaneously  develop  a  robotic  product 
and  service.  By  this  we  mean  that  the  robot  as  a  product  would 
have  to  be  more  than  sociable  and  attractive;  it  would  need  to 
deliver  something  useful  to  people.  We  adopted  this  goal  to 
increase  the  likelihood  that  people  would  continue  to  be  interested 
in  interacting  with  the  robot  over  a  period  of  time  [21].  By 
developing  a  snack  delivery  service  that  worked  with  wireless 
service  points  in  the  building,  we  could  collect  and  record 
knowledge  about  people’s  snack  preferences,  and  use  these  to 
further  enhance  the  service  we  provide  to  them. 

The  third  goal  was  to  develop  interaction  designs  that  would  help 
to  evoke  social  behavior.  Because  the  robot  was  meant  to  serve  as 
a  research  platform  that  would  be  used  by  people  over  time, 
decisions  about  functions  and  features  were  made  supporting  the 
interest  of  promoting  sociability.  For  instance,  we  aimed  to  have 
the  robot  interact  with  people  using  natural  language.  Other 
research  has  shown  that  people  interact  with  a  robot  longer  when 
it  exhibits  social  cues  [5]  [12].  Other  aspects  of  sociability  that  we 
plan  to  explore  and  extend  include  personalization  of  the  service, 
and  robot  politeness  and  non-verbal  behaviors  [2]. 

4.  SNACKBOT  TEAM 

Developing  a  robot  in  a  holistic  manner  required  interdisciplinary 
collaboration.  The  Snackbot  team  consisted  of  5  faculty,  5 
graduate  students,  and  7  undergraduate  students  drawn  from 
several  disciplines  including  design,  behavioral  sciences, 
computer  science,  and  robotics.  Because  of  the  wide  range  of 
expertise,  we  frequently  had  members  from  one  group  attending 
the  meetings  of  the  other.  For  instance,  the  designers  worked  on 
the  form  studies  but  they  often  interacted  with  the  engineers,  and 
everyone  helped  out  with  the  empirical  studies.  Organization  of 
this  group  was  assisted  through  the  use  of  an  on-line  forum  called 
the  Kiva  (www.thekiva.org),  hosted  on  a  website  accessible  to 
team  members  from  anywhere  on  the  Internet.  This  web  facility 
was  useful  because  all  of  the  information  was  organized  and 
presented  in  a  searchable,  threaded  format  to  the  entire  team. 

A  great  deal  of  emphasis  was  placed  on  good  documentation  of 
process,  code  and  interim  prototype,  so  any  new  person  on  the 
project  can  follow  in  the  footsteps  of  those  that  worked  on  it 
before. 

5.  SYSTEM  OVERVIEW 

To  give  the  reader  a  snapshot  of  what  the  design  process  has 
achieved  and  where  we  plan  to  go,  we  present  an  overview  of  the 
Snackbot.  The  Snackbot  robot  is  a  four-and-a-half-foot  tall,  semi- 
autonomous  semi-humanoid  robot  shown  in  Figure  1.  It  traverses 
the  hallways  of  our  buildings,  delivering  snacks  to  residents  in 
offices  and  labs.  The  Snackbot  will  have  its  own  “office”  where 
people  can  send  email  or  IM  for  ordering  snacks  (or  sending 
snacks  as  gifts  to  others).  We  also  plan  extensions  of  the  service. 
For  example,  the  Snackbot  might  visit  a  group’s  lounge  area  and 
invite  the  group  to  socialize  around  a  particular  snack. 

5.1  Hardware 

The  Snackbot  robot  is  based  on  the  existing  CMAssist  platform 
[27],  augmented  with  some  commercial  hardware  and  software 
and  new  elements  and  code.  The  Snackbot  uses  a  MobileRobots 
Inc.  Pioneer  3  DX  base  for  mobility.  Bumpers,  sonars,  and  a 
SICK  laser  are  used  to  detect  and  avoid  collisions  and  to  detect 


position  within  an  environment.  A  Hokuyo  URG  laser  is  mounted 
in  the  robot’s  chest  to  detect  potential  collisions  with  higher 
objects,  and  to  detect  people  by  torso. 

The  Snackbot  currently  has  non-functional  arms  that  hold  a  tray, 
used  for  carrying  snacks  (Figure  2).  The  tray  is  equipped  with  12 
load  cells;  each  is  capable  of  measuring  a  weight  range  of  13  to 
763  grams.  With  this  functionality,  the  robot  will  know  when 
someone  has  removed  or  replaced  a  snack  on  the  tray. 


Figure  2.  Sketch  of  the  Snackbot  tray,  showing  the  load  cell 
configuration. 

An  Acoustic  Magic  microphone  array  is  mounted  under  the  tray. 
It  serves  as  the  primary  audio  input  source  for  the  robot’s  natural 
spoken  language  and  dialog  processing  system.  The  robot’s  head 
is  mounted  on  a  Directed  Perception  pan/tilt  unit,  affording  a  360- 
degree  pan  range  and  a  111-degree  tilt  range.  A  Point  Grey 
Bumblebee  2  stereo  camera  is  mounted  behind  the  robot’s  eyes;  a 
monocular  Point  Grey  Dragonfly2  camera  is  mounted  on  the  top 
of  the  head  and  is  fitted  with  a  180-degree  fisheye  lens  from 
Omnitech  Robotics.  The  Snackbot  also  has  two  2.4GHz  Intel 
laptops  running  Ubuntu  Linux  for  data  processing. 

5.2  Software 

The  Snackbot  uses  MobileRobot’s  ARIA  API  that  works  with  the 
Pioneer  base.  ARNL  provides  functionality  for  map  construction, 
and  path  planning.  A  distributed  software  architecture  developed 
by  the  CMAssist  project  [27]  interfaces  with  the  behavior  control 
modules  and  the  speech  processing  interface.  When  the  Snackbot 
moves  through  its  environment,  it  will  track  its  current  position  by 
comparing  the  current  set  of  laser  scans  and  an  odometry  estimate 
against  a  previously  programmed  map. 

We  use  an  Augmented  Transition  Network  (ATN)  manager  for 
our  dialogue  system.  This  will  allow  for  a  flexible  discourse 


structure,  but  will  require  more  work  by  a  dialogue  designer.  We 
also  use  an  open  source  Sphinx4  speech  recognizer  system 
(http://cmusphinx.sourceforge.net/sphinx4/),  written  in  Java,  and 
the  Cepstral  speech  synthesizer  [20]. 

5.3  Form 

The  form  of  the  robot  is  made  of  cast  fiberglass  and  is  custom 
designed  to  fit  the  Pioneer  base  and  an  internal  structure  that 
anchors  the  laptop  and  other  components.  It  has  a  semi-humanoid 
form  and  uses  simple  geometric  shapes.  There  are  three  exterior 
pieces:  one  for  the  head,  one  for  the  torso,  and  one  for  the  base. 

5.4  Interaction 

There  are  several  basic  modes  of  interaction  with  the  robot.  In 
stationery  mode,  the  robot  is  positioned  in  a  social  space  and 
people  can  approach  the  robot  to  help  themselves  to  a  snack.  In 
roaming  mode,  the  robot  uses  the  map  to  visit  people’s  offices  and 
to  deliver  snacks.  Snacks  will  be  ordered  in  advance  (using  a  web 
page,  email,  or  IM)  or  selected  during  the  visit.  Customers  will 
register  on  the  Snackbot  website  and  will  get  points  for  snacks  in 
exchange  for  being  involved  in  the  research. 

To  interact  with  the  Snackbot,  people  eventually  will  engage  in 
natural  dialogue  with  the  dialogue  system.  Visual  feedback  will 
occur  through  an  LED  mouth,  which  will  indicate  when  the  robot 
is  “talking.”  Sound  will  be  used  as  an  additional  informational 
cue. 

6.  DESIGN  PROCESS 

To  holistically  conceive  of  the  robot  as  a  product  and  service,  we 
had  to  consider  many  aspects  of  the  design  process  concurrently: 
the  social  and  physical  context  of  the  environment  it  would 
operate  in,  its  form,  and  how  it  would  interact  with  people.  Table 
1  summarizes  our  design  activities  aligning  with  our  overarching 
design  goals  for  each  phase  of  the  project. 

6.1  Needs  Analysis 

We  conducted  needs  analysis  and  context  research  on  snacking  in 
our  office  buildings,  described  in  more  detail  elsewhere  [19].  Our 
environmental  research  took  the  form  of  a  campus  survey  to 
document  all  of  the  places  where  people  can  get  snacks.  From 
candy  dishes  in  administrative  offices  to  vending  machines  in  the 
basements  of  building,  we  mapped  site  lines  and  studied  each  site 
for  accessibility.  We  also  mapped  distances  to,  and  popularity  of, 
nearby  locations  that  are  popular  for  snack  breaks  —  for  instance, 
a  local  coffee  shop  that  is  frequented  by  members  of  the  campus 
community.  One  of  the  findings  from  this  work  was  that  people 
mainly  choose  convenience  over  snack  quality,  but  they  do  not 
mind  walking  for  a  snack  if  social  interaction  is  part  of  the 


Table  1.  Design  goals  and  design  activities. 


Design  activities 
/  Design  goals 

Needs  analysis 

Form  giving  & 
interaction  design 

Documentation  &  evaluation 

Develop  the  robot  holistically 

Context  research  on  snacking 

Form  research;  assess  tradeoffs 
in  material  and  technology 
selection 

Evaluative  field  studies  to 
understand  change  in  people, 
product  use,  physical  and 
social  context 

Develop  product  &  service 
simultaneously 

Site  survey  of  snacking 

Scenario  development;  trial  of 
delivery  service  with  human 
confederate 

Process  blueprint  for  robotic 
product  and  service  design 

Develop  interaction  designs 
that  evoke  social  behavior 

Understanding  of  physical, 
social,  psychological  reasons 
people  snack 

Dialogue  structure  study; 
height  and  approach  study 

Checklist  for  interaction 
design  considerations  in  HRI 

activity  (and  especially  if  the  snack  is  free).  Based  on  our 
observations,  we  created  two  basic  modes  of  interaction  for  the 
robot:  mobile  and  stationary.  We  decided  that  the  robot  in  mobile 
mode  should  offer  to  deliver  healthy  snack  choices  such  as  fruit, 
and  that  in  stationary  mode  should  offer  high  quality  snacks  in 
communal  locations  that  would  attract  groups.  These  decisions 
support  our  overall  design  goal  of  evoking  social  behavior,  and 
ensure  that  we  are  not  making  a  robot  that  will  only  bring 
fattening  snacks  to  sedentary  people. 


Figure  3.  Sketches  for  the  robot  housing:  a)  machine-like,  b) 
rounded  and  friendly,  c)  concepts  combined. 

6.2  Form  Giving  and  Interaction  Design 

Form  giving  and  interaction  design  encompass  all  of  the  activities 
necessary  to  generate  a  first  design  of  the  robot  and  service,  both 
in  terms  of  design  and  varied  studies  to  confirm  the  design.  In  this 
phase,  we  researched  and  generated  robot  forms,  and  also 
conducted  empirical  studies  to  evaluate  the  design  decisions  that 
we  made. 

Product  research  took  the  form  of  collecting  and  analyzing  images 
of  existing  social  robots,  which  ranged  from  animals  to  abstract  to 
humanoid  forms.  We  categorized  these  into  four  types:  humanoid, 
abstract,  semi-humanoid,  and  other.  Humanoid  robots  were  of 
interest,  because  they  mimic  the  anatomy  and  form  of  the  human 
figure.  Research  on  humanoid  robots  has  shown  that  they  are 
perceived  friendly  and  appropriate  for  tasks  that  involve  close 
interaction  with  people  [15][31][35].  However,  humanoids  are 
mechanically  complex,  and  for  our  research,  may  not  be  robust 
enough  for  long-term  use  in  the  field.  Abstract  robots  were  less 
relevant  because  they  have  a  mechanical  aesthetic,  showing 
tracks,  wheels,  and  other  parts  that  do  not  invite  human 
interaction  at  an  intimate  level.  Semi-humanoid  robots  were  of 
greatest  interest,  because  they  combine  simple  geometric  forms 
with  human  cues.  This  was  a  good  choice  for  further 
investigation,  as  the  housing  design  would  then  allow  for  the 
holistic  combination  of  hardware  and  aesthetic  components. 

We  generated  sketches  based  on  the  semi-humanoid  concept.  Two 
types  of  sketches  were  initially  explored:  more  industrial, 
mechanical  forms  with  wide  shoulders,  aggressive  stance,  and 
masculine  proportions,  and  more  playful,  cute  forms  with  rounded 
proportions  and  childlike  faces  (Figure  3).  To  support  our  goal  of 
social  interaction  by  making  the  robot  approachable  by  everyone, 


we  merged  these  two  styles  to  create  a  gender-neutral,  friendly, 
yet  professional-looking  form  to  fit  the  context  of  our  university. 

We  conducted  empirical  studies  to  investigate  and  support  our 
form  giving  process.  Here,  we  describe  four  of  them  as  examples: 
an  early  technology  feasibility  study,  an  early  interaction  study,  a 
dialogue  study,  and  a  height  and  approach  study.  Each  of  these 
studies  was  conducted  in  support  of  our  overarching  design  goals 
for  holistic  development,  product  and  service,  and  social 
behavior.  Each  generated  design  implications  for  our  robot  and 
tradeoffs  with  other  aspects  of  the  system.  The  process  and  results 
of  these  iterative  studies  are  described  in  this  section. 

6.2. 1  Early  Technology  Feasibility  Study 
We  assembled  some  of  the  robot’s  key  capabilities  on  an  existing 
mobile  robot  platform,  the  CMAssist  robot  [27].  The  goal  of  this 
study  was  to  test  and  verify  the  basic  functionality  of  the  major 
components  of  the  system,  and  to  ensure  that  it  would  work 
smoothly  with  the  wireless  network  in  our  buildings. 

The  robot,  partly  tele-operated,  traversed  hallways  for  five  two- 
hour  long  sessions  over  a  two-week  period,  in  the  two  campus 
buildings  described  above,  and  prompted  passersby  to  take  free 
snacks.  An  experimenter  using  a  joystick  about  20  meters  away 
controlled  the  robot.  The  dialogue  system  was  also  run  using 
streamed  audio  and  five  human-controlled  utterances,  allowing  us 
to  quickly  understand  the  timing  and  robustness  for  this  type  of 
dialogue  system.  To  help  our  technology  prototype  look  like  an 
aesthetic  robot  design,  we  created  a  housing  with  vacuum-formed 
materials  and  foam  core  components  (Figure  4). 

This  early  trial  helped  us  leam  about  many  tradeoffs  we  would 
face  in  the  future  design  of  the  hardware,  software,  and 
interaction  design  of  the  robot.  We  subsequently  decided  to  use  a 
commercially  available  base  for  the  Snackbot.  A  Pioneer  base 
would  be  more  reliable  than  a  home-built  base,  and  would  provide 
mobile  functions  that  would  be  easily  replicable.  It  would  also  be 
quieter  and  less  distracting  to  office  residents.  One  drawback  of 
using  this  kind  of  base  is  that  it  would  create  a  set  of  constraints 
for  the  final  industrial  design  of  the  robot  housing.  Such 
constraints  included  the  dimensions  of  the  robot,  the  availability 
(or  lack  thereof)  of  mounting  points  for  the  torso,  and  maximum 
load  that  could  be  carried.  Our  plans  for  the  torso  and  other 
electronics  exceeded  the  recommended  weight  limit  of  the 
Pioneer,  and  so  later  experiments  were  performed  to  leam  the 
maximum  reasonable  weight  the  robot  could  carry  while  still 
having  reasonable,  operational  battery  life. 


Figure  4.  Prototype  used  for  early  technology  feasibility  study. 

In  terms  of  software,  we  learned  that  it  could  be  feasible  to 
entirely  automate  the  dialogue  stmcture  using  a  finite  number  of 
preset  phases  because  conversation  with  the  robot  quickly 


revealed  stable  patterns  -  a  sequence  of  greeting,  selection  of 
snacks,  and  payment.  We  also  learned  that  we  would  need  to 
devise  ways  to  deal  with  network  lag  or  drop-off  and  still  preserve 
the  idea  of  a  sociable,  fluidly  interacting  robot.  This  led  us  to 
pursue  the  interaction  study  described  in  the  next  section. 

6. 2. 2  Early  Interaction  Study 

Our  early  interaction  design  study  took  the  form  of  three  semi- 
structured  trials  with  the  first  robot  prototype  in  two  campus 
buildings.  Here,  our  goal  was  to  come  up  with  archetypical 
dialogue  structures  for  interacting  with  the  Snackbot,  to  support 
our  design  goals  of  product  and  service  and  robust  social 
interaction. 

We  used  Wizard  of  Oz  methods,  where  a  remote  dialogue 
operator  used  Skype  and  interactively  “chatted”  with  snack 
customers.  A  separate  operator  performed  motor  control  of  the 
robot  using  a  joystick  and  tether.  We  adopted  the  convention  of 
American  ice  cream  trucks,  and  developed  a  30-second  melody 
and  a  cheery  “Hello!”  for  the  robot  to  announce  itself  in  the 
hallways.  Interaction  with  customers  was  structured  in  that  the 
Skype  operator  had  a  script  to  follow,  but  could  deviate  from  it  in 
real  time  if  needed. 

We  learned  that  people  found  the  melody  and  greeting  to  be  too 
annoying  for  use  in  an  office  building.  This  was  partly  due  to  the 
fact  that  the  sound  was  played  from  a  low-quality  speaker,  and 
therefore  distorted,  but  the  social  norms  of  an  office  environment 
also  played  a  role.  We  also  learned  that  a  minimal, 
straightforward  design  of  the  dialogue  would  be  all  that  is  needed, 
because  people  readily  filled  in  dialogue  and  other  social  cues, 
such  as  indicating  which  snack  that  they  intended  to  take  off  the 
tray  by  showing  it  to  the  robot’s  eye  cameras,  and  by  politely 
repeating  phrases  during  their  interactions.  These  findings 
suggested  methods  for  collecting  speech  and  environmental  sound 
as  input  for  the  dialogue  system,  and  gave  us  ideas  for  how  to 
specifically  design  and  study  the  dialogue  system,  which  we 
describe  in  the  next  section. 

6. 2. 3  Dialogue  Study 

We  next  conducted  a  study  to  verify  our  design  of  the  dialogue 
structures  and  scenarios.  Our  overarching  goal  was  to  discover 
how  to  provide  dialogue  with  the  robot  in  a  way  that  evokes 
social  behavior  and  allows  the  service  to  proceed  as  intended. 

We  created  general  dialogue  excerpts  and  ran  them  in  a  Wizard  of 
Oz  study  with  12  participants.  One  experimenter  ran  the  robot’s 
dialogue  scripts  in  a  remote  location,  and  another  noted  what  the 
participant  said  in  response  to  the  scripts.  We  used  the  stationery 
mode  as  a  scenario  for  the  study  — passersby  approached  the 
robot  and  discussed  what  snacks  were  available  that  day. 

We  learned  several  things  about  our  first  iteration  of  the  dialogue 
design.  First,  nearly  half  of  the  phrases  we  designed  were 
unsuitable  in  that  people  frequently  deviated  from  the  script  as  we 
designed  it.  We  added  phrases  to  control  for  unintelligible  speech 
or  users  wandering  off  topic.  We  also  learned  that  people  liked  to 
play  with  the  dialogue  structure  to  see  where  it  might  fail.  For 
example,  if  the  Snackbot  asked,  “Is  this  your  first  visit?”  a 
participant  might  answer  “I  have  been  here  lots  of  times  but  I 
have  never  seen  you,”  instead  of  giving  a  simple  yes  or  no 
answer.  Although  we  tried  to  structure  the  dialog  to  discourage 
such  behavior,  we  were  unsuccessful.  We  subsequently  added 
phrases  to  try  to  smooth  over  these  communication  breakdowns. 

We  found  that  care  needs  to  be  taken  in  constructing  the  output 


phrases  so  that  they  are  intelligible  and  imitate  human  intonation. 
Although  our  synthesizer  is  state-of-the  art,  certain  words, 
phrases,  and  spellings  can  result  in  difficult  to  understand  speech. 
The  synthesizer  has  trouble  particularly  with  the  rise  of  voice 
expected  when  people  ask  questions.  For  example,  “Would  you 
like  an  apple?”  sounds  strange  with  synthesized  speech 
intonation.  Thus  we  learned  the  Snackbot  should  instead  say,  “I 
want  to  know  if  you  would  like  an  apple,”  to  eliminate  intonation 
issues. 

We  found  that  some  participants  used  visual  cues  much  more  than 
others,  thereby  minimizing  the  use  of  dialogue.  In  particular,  they 
tended  to  examine  the  tray  rather  than  asking  what  snacks  were 
offered,  and  to  simply  remove  the  item  without  verbally 
indicating  what  they  would  like,  despite  a  direct  question.  We 
learned  that  we  would  need  to  tightly  couple  the  dialogue  system 
with  the  sensor  system  to  adequately  track  all  of  the  non-verbal 
communication  in  support  of  evoking  social  behavior. 

Other  interesting  social  interactions  were  observed,  such  as 
groups  of  people  interacting  with  the  robot.  Group  conversation 
presents  a  difficulty  for  the  speech  recognition  system,  which  is 
unlikely  to  differentiate  person-to-person  conversations  from 
those  targeted  towards  the  Snackbot.  Some  of  this  difficulty  can 
be  mitigated  with  careful  integration  with  other  sensors.  To  best 
understand  where  to  place  these  sensors,  we  undertook  a  height 
and  approach  study  described  in  the  next  section. 

6. 2. 4  Height  and  Approach  Study 

Rather  than  arbitrarily  deciding  the  height  of  the  robot,  we  wanted 
to  learn  whether  the  height  of  the  robot  affects  people’s  approach 
interactions  with  the  robot.  To  our  knowledge,  there  have  been  no 
formal  studies  about  the  body  size  of  a  robot.  Therefore,  we 
conducted  a  study  to  discover  what  an  appropriate  height  might 
be  for  the  Snackbot. 

We  conducted  a  between-subjects  experiment  with  72  participants 
using  the  technology  feasibility  prototype  described  earlier.  The 
robot  had  three  height  conditions,  44  inches  (112  centimeters), 
50.5  in  (128  cm),  and  56  in  (142  cm).  We  chose  these  three 
heights  as  deviations  from  the  average  height  of  a  small  human 
being  with  an  average  reach  of  lower  arm  length,  so  it  would  be 
comfortable  to  approach  and  take  a  snack  from  the  robot  even  in 
the  shortest  condition.  We  did  not  want  to  make  the  robot  taller 
than  people  in  order  not  to  be  threatening. 

The  study  was  conducted  in  a  public  area  of  our  campus.  We 
offered  free  snacks  for  participating  in  the  survey.  We  used  a  5- 
point  Likert  scale  to  understand  how  friendly  and  intelligent 
people  felt  the  robot  was,  and  how  they  responded  to  the  height  of 
the  robot.  An  open-ended  question  asked  participants  to  list  the 
personality  traits  that  they  ascribe  to  the  robot.  We  also  asked 
participants  their  gender,  age,  and  height. 

Using  a  5 -point  scale  where  1  =  much  too  small  and  5  =  much  too 
tall,  participants  preferred  the  tallest  robot  most,  F  [1,71]  =  4.10,  p 
<.02.  The  smallest  and  mid-sized  robots  averaged  2.4,  meaning 
they  were  between  “too  small”  (score  of  2)  and  “just  right”  (score 
of  3).  The  tallest  robot  was  almost  just  right  with  a  mean  of  2.9. 
Participants  liked  the  fact  that  they  could  make  eye  contact  with 
the  tallest  robot,  and  disliked  that  they  had  to  bend  to  interact  with 
the  smaller  two  robots.  There  was  no  difference  in  terms  of  how 
friendly  and  intelligent  people  felt  the  robot  was  across 
conditions.  In  addition,  no  correlation  was  observed  between  the 
participants’  height  and  the  robot’s  height  that  they  preferred. 
However,  there  were  interesting  differences  in  the  personality 


traits  participants  attributed  to  each  prototype.  The  smallest  robot 
most  frequently  was  described  as  servile,  obedient,  and 
submissive.  We  felt  that  to  best  support  our  goal  of  evoking  social 
behavior,  the  robot  should  be  seen  more  as  a  co-worker  than  as  a 
servant.  In  our  university  culture,  even  the  least  skilled  workers 
are  given  respect  more  as  peers  than  as  servants.  This 
consideration  also  indicated  that  the  tallest  robot  would  be  most 
appropriate. 

Armed  with  the  findings  from  the  early  technology  feasibility 
study,  the  early  interaction  study,  the  dialogue  study,  and  the 
height  and  approach  study,  we  built  the  second  prototype  of  the 
Snackbot. 

7.  SECOND  PROTOTYPE 

We  embarked  on  designing  a  more  robust,  refined  system,  using  a 
Pioneer  base.  This  decision  was  made  to  support  our  design  goals 
of  offering  a  product  and  service  in  our  office  environment,  by 
reducing  the  distracting  noise,  and  ensuring  operation  over  long 
periods  of  time  and  a  variety  of  floor  types. 

From  our  interaction  study,  we  learned  that  we  would  need  to 
develop  a  set  of  sensors  that  would  allow  us  to  know  when  a 
snack  was  taken.  Because  we  did  not  want  to  overload  the  vision 
system,  which  would  eventually  support  person  recognition,  we 
added  a  mid-chest  laser  and  pressure  sensors  to  the  robot’s  tray. 
These  additions  would  also  support  natural  social  behavior 
between  Snackbot  and  its  customers. 

To  develop  the  second  prototype,  we  focused  on  the  development 
of  the  housing,  the  design  of  the  tray,  and  the  development  of  an 
internal  structure  to  anchor  the  sensors,  laptops,  and  housing  to 
the  Pioneer  base.  We  also  focused  on  the  expressive  qualities  of 
the  robot’s  face,  and  finalizing  the  interaction  design. 

7.1  Housing 

Working  from  the  early  sketches  described  above,  we  built  a 
number  of  quarter-scale  and  half-scale  models  of  the  robot.  After 
ascertaining  correct  proportions  for  the  dimensions  of  the  Pioneer, 
laptops,  and  other  internal  components,  we  constructed  a  full- 
scale  mock-up  to  proportions,  radii,  and  design  details  (Figure  5). 
Using  this  prototype,  our  design  team  was  able  to  address 
dimensions,  hardware  placement,  configurations,  assembly,  and 
tray  size  and  arm  options. 


Figure  5.  Full-scale  mock-up  of  the  robot. 

To  check  responses  to  the  full-scale  model,  we  placed  it  in  a 
hallway  in  our  building  and  conducted  a  survey  with  59 
participants  to  understand  positive  or  undesirable  associations  to 


the  design.  Participants  rated  the  robot  as  friendly  and  likable 
(mean  3.88  and  3.87,  a  5-point  scale),  and  neither  intelligent  or 
unintelligent  (mean  3.42).  The  robot  evoked  descriptions  of 
service  jobs  such  as  a  waiter  or  waitress,  or  general  Sci-Fi 
characters  such  those  from  the  Jetsons.  Based  on  these  responses, 
we  felt  that  the  final  form  design  supported  our  three  design 
goals. 

One  of  the  issues  with  the  housing  was  weight.  The  Pioneer  has  a 
recommended  payload  limit  of  50  lbs.  for  carrying  additional 
weight.  From  our  experiments,  we  determined  that  between  70 
and  80  lbs.  of  weight  was  still  reasonable  for  the  robot  to  carry 
and  still  have  an  acceptable  operational  lifetime.  This  drove  the 
selection  of  light  materials  such  as  fiberglass  and  neoprene  fabric 
for  the  outer  housing  and  aluminum  80/20  for  the  inner  housing. 
We  also  segmented  the  base  and  made  a  variety  of  cuts  in  the 
torso  to  reduce  weight.  The  resulting  housing  is  lightweight, 
strong,  and  easy  to  add  attachments  for  internal  materials. 

After  generating  a  number  of  color  studies  for  the  housing,  we 
selected  a  color  scheme  of  medium  gray  and  orange.  Both  hues  do 
not  cause  gender  attributions  or  strong  attributions  of  service  type 
in  the  U.S.  culture.  For  example,  a  blue  robot  might  connote  a 
medical  service,  due  to  the  ubiquitous  use  of  the  color  blue  in  the 
health  sector.  A  green  robot  might  connote  a  sustainable  product. 
Orange  is  also  often  associated  with  food  and  restaurants. 
Together,  the  orange,  gray,  and  dark  gray  of  the  neoprene  creates 
a  distinctive,  impressive  form. 

7.2  Tray 

The  tray  design  (Figure  2)  was  developed  for  providing  food  or 
snacks  at  the  appropriate  delivery  height,  but  also  as  an  input 
system  for  measuring  the  weight  and  presence  or  absence  of  items 
on  the  tray.  The  tray  has  movable  slots  that  can  be  configured  in  a 
number  of  ways  to  hold  different  snacks.  The  tray  is  made  of 
aluminum,  styrene,  12  load  cells,  and  a  cloth  covering  on  which 
snacks  will  be  placed. 

7.3  Internal  Structure 

The  design  for  the  internal  structure  continued  to  evolve  as  the 
external  housing  design  was  finalized.  To  minimize  weight  while 
providing  maximum  strength,  three  vertical  struts  of  extruded 
aluminum  were  used  as  the  base  for  the  design.  We  augmented 
these  with  a  custom  aluminum  plate  at  the  top  of  the  Pioneer  base 
and  one  at  the  shoulder,  to  mount  internal  components.  These 
additions  allowed  for  retrofitting  to  a  variety  of  components  using 
off-the-shelf  brackets  and  anchors.  These  design  decisions  afford 
modularity,  which  support  our  overall  goal  of  holistic  design. 

7.4  Head  and  Face 

Our  overall  goal  was  to  create  an  expressive  head  that  would 
serve  as  a  locus  of  interaction,  relying  on  appropriate  features  that 
convey  the  right  level  intelligence  and  functionality  to  the  robot 
[10]. 

The  final  head  design  features  a  simple  form  that  is  wider  than 
tall,  suggesting  a  young,  friendly  robot.  The  width  of  the 
Bumblebee  camera  also  determined  the  width  of  the  head  and  the 
placement  and  size  of  the  eye  sockets.  We  also  felt  that  by 
minimizing  complexity  and  detail  in  the  eyes,  Snackbot  customers 
would  not  develop  false  perceptions  about  the  intelligence  of  the 
robot. 

A  3  x  12  LED  display  was  developed  for  the  mouth,  serving  as  an 
expressive  focus  for  interaction.  The  mouth  is  programmed  with  a 


series  of  animations  that  show  verbal  and  emotional  feedback  in 
the  form  of  lip  shapes,  colors  and  movement.  Although  the  robot 
does  not  have  functional  ears,  we  added  ears  to  the  head  design, 
so  that  customers  would  understand  that  the  robot  could  hear 
them. 

7.5  Interaction  Design 

The  final  interaction  design  for  the  robot  includes  stationary  and 
mobile  delivery  modes  that  provide  a  variety  of  services  to  our 
university  community  and  support  social  behavior.  We  have 
designed  a  basic  interaction  infrastructure,  so  we  can  use  and  vary 
these  modalities  to  conduct  experiments  once  the  robot  is  fully 
implemented.  Snacks  can  also  be  ordered  through  a  web  site  or 
IM  service.  The  interaction  design,  of  course,  will  evolve  as  we 
conduct  field  trials  with  the  Snackbot. 

8.  LESSONS  LEARNED 

We  have  spent  almost  two  years  on  the  holistic  development  of  a 
robotic  system,  and  we  have  learned  several  lessons.  We 
articulate  them  here,  relative  to  our  overall  design  goals. 

Our  first  lesson  was  to  understand  how  the  robot  would  actually 
work  in  a  context  of  people,  other  products,  a  physical 
environment,  and  social  norms.  Then,  in  the  service  of  holistic 
design,  the  design  of  particular  subsystems  could  be  undertaken. 
This  point  is  not  new.  Many  others  have  articulated  the  need  to 
design  for  the  context  (e.g.,  Jones  and  Hinds  [17]). 

Our  second  lesson  in  terms  of  holistic  design  was  to  design  for 
modularity.  Functions  should  be  developed  individually,  but  with 
an  eye  to  the  constraints  caused  by  other  aspects  of  the  system. 
Modularity  also  means  that  components  can  be  upgraded  or 
changed  as  new  and  better  systems  become  available.  For 
example,  the  selection  of  a  Pioneer  base  created  weight 
constraints,  which  became  an  issue  in  the  design  of  the  housing. 
Again,  others  in  various  fields  have  recommended  designing  in 
modularity  (e.g.,  Cai  and  Sullivan  [7]). 

In  terms  of  product  and  service,  we  learned  the  robot  should  offer 
capabilities  that  add  value  to  people’s  lives,  and  allow  them  to 
add  value  themselves  through  interacting  with  the  robot.  This  idea 
drove  our  choice  to  offer  healthy  snacks,  and  to  provide  a 
stationary  mode  that  invites  people  to  take  a  walk  to  the  robot. 
Future  experiments  will  be  done  to  understand  whether  and  how 
the  robot’s  interaction  design  can  be  modified  to  best  support 
people. 

In  terms  of  social  behavior,  we  learned  to  work  to  make  a  social 
robot  sociable  within  the  limitations  of  current  technology.  For 
example,  we  needed  to  make  iterative  changes  to  the  dialogue 
system  to  both  support  fluid  and  natural  social  interaction  while 
working  with  the  constraints  that  the  wireless  network  provided. 

None  of  these  lessons,  taken  independently,  are  new,  as  the  HRI 
community  will  recognize.  What  we  think  is  a  contribution  is  our 
showing  how  we  tried  to  tackle  all  these  lessons  together.  The 
larger  lesson  is  that  designing  for  all  these  goals  is  what  is  really 
hard.  It  requires  a  design  team  dedicated  to  an  interdisciplinary 
holistic  design  process. 

9.  CONCLUSION  AND  FUTURE  WORK 

In  this  paper,  we  have  presented  the  design  and  development 
process  for  the  Snackbot  robot,  a  robot  that  is  designed  for  long¬ 
term  delivery  of  snacks  in  our  building.  We  had  three  overarching 
goals  for  the  development  of  this  system:  to  develop  a  robot 
holistically,  to  develop  the  robot  as  a  product  and  a  service  within 


our  building,  and  to  design  the  robot  in  a  way  that  evokes  social 
behavior. 

Our  future  research  will  assess  the  success  of  the  design  relative 
to  our  design  goals  through  extensive  field  trials  to  understand 
change  in  people,  product  use,  physical  and  social  context.  As  the 
interaction  and  service  design  of  the  robot  are  flexible,  we  will 
conduct  experiments  by  changing  the  form  and  the  behavior  of 
the  robot.  Few  examples  include  experiments  on  how  the  robot 
might  use  music  and  nonverbal  behavior  instead  of  words  to 
communicate  with  customers  at  their  offices,  and  experiments  on 
personalized  services. 
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